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© A novel retinoic acid receptor is disclosed. The novel receptor is encoded for by cDNA carried on plasmid 
phRARl , which has been deposited with the American Type Culture Collection for patent purposes. Chimeric 
receptor proteins are also disclosed. The chimera are constructed by exchanging functional domains between 
the glucocorticoid, the mineralocorticoid, the estrogen -related, the thyroid and the retinoic acid receptors. In 
addition, a novel method for identifying functional ligands for receptor proteins is disclosed. The method, which 
takes advantage of the modular structure of the hormone receptors and the idea that the functional domains may 
be interchangeable, replaces the DNA- binding domain of a putative novel receptor with the DNA- binding 
domain of a known receptor such as the glucocorticoid receptor. The resulting chimeric construction, when 
expressed in cells, produces a hybrid receptor whose activation of a ligand - (e.g., glucocorticoid) inducible 
promoter is dependent on the presence of the new ligand. The novel method is illustrated in part by showing 
that the ligand for the new receptor protein is the retinoid, retinoic acid. 
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FIELD OF THE INVENTION 

" Th present invention relates generally to ligand - responsive regulatory proteins and genes encoding 
them. More particularly, the present invention relates to retinoid related regulatory proteins and genes 
5 encoding them, modification of these and other regulatory proteins and genes by recombinant DNA and 
other genetic engineering techniques, plus uses of the retinoid related regulatory proteins and genes, both 
unmodified and modified. 

In addition the invention relates to a novel method for identifying functional ligands for ligand - 
responsive proteins. This method is especially useful for identifying functional ligand(s) for newly discovered 
io receptor proteins. The method is exemplified in part by showing that a vitamin A related morphogen, 
retinoic acid, is a functional ligand for a newly discovered retinoid receptor protein. 

BACKGROUND OF THE INVENTION 

75 A central problem in eukaryotic molecular biology continues to be elucidation of molecules and 
mechanisms that mediate specific gene regulation in response to exogenous inducers such as hormones or 
growth factors. Although much remains to be learned about the specifics of such mechanisms, it is known 
that exogenous inducers such as hormones modulate gene transcription by acting in concert with 
intracellular components, including intracellular receptors and discrete DNA known as hormone response 

20 elements or HRE ? s. 

More specifically, it is known that hormones like the glucocorticoid and thyroid hormones enter cells by 
facilitated diffusion, ft is also known that the hormones then bind to specific receptor proteins, thereby 
creating a hormone/receptor complex. The binding of hormone to the receptor is believed to initiate an 
alosteric alteration of the receptor protein. As a result of this alteration, it is believed that the 

25 hormone/receptor complex is capable of binding with high affinity to certain specific sites on the chromatin 
DNA. Such sites, which are referred to in the art by a variety of names, including hormone response 
elements or HRFs, modulate expression (transcription of RNA) of nearby target gene promoters. 

A major obstacle to further understanding the specifics of gene regulation by exogenous inducers such 
as hormones has been the lack of availability of receptor proteins in sufficient quantity and purity to allow 

30 such proteins to be adequately analyzed and characterized. This same lack of availability has thwarted the 
use of receptors in diagnostic assays to determine the presence of exogenous inducers (e.g., the 
hormones) in various body fluids and tissues, as well as their use as "prototypes" for engineering chimeric 
receptor protein analogs. 

In an effort to overcome this lack of availability of receptor proteins, co- pending application U.S.S.N. 

35 108,471, which has been assigned to the Salk Institute for Biological Studies, assignee of the present 
application, discloses cloned genes for a variety of receptor proteins, including glucocorticoid - , thyroid - , 
mineralocorticoid - and new steroid - related receptors. U.S.S.N 108,471 further discloses detailed bio- 
chemical characterization of these molecules which shows that the receptor proteins contain discrete DNA - 
and ligand -binding domains. (Portions of U.S.S.N. 108,471 have been published; for portions relating to 

40 cloning ot the glucocorticoid receptor and characterization of this molecule into discrete domains, see 
i Hollenberg, ef a/. (1985) and Giguere, ef a/., (1986); for other related work regarding receptors, see 
Hoilenberg, ef a/., (1987), Green, et aL (1986). Green and Chambon, (1987), Kumar, ef a/., (1987), 
Miesfeld, ef a/., (1987) and Evans (1988)). USSN 108,471 corresponds to W088- 03168. 

Further with regard to biochemical characterization of the receptors, sequence analysis of the human 

45 glucocorticoid receptor gene revealed homology with the product of the v-erb-A oncogene of avian 
erythroblastosis virus (AEV) (see Weinberger, et a/., (1985)). This group and others subsequently dem- 
onstrated the cellular homolog of v-erb-A to be the beta thyroid hormone receptor (see Weinberger et 
at., (1986) and Sap, ef a/., (1986)). 

The discovery that the DNA -binding domain of the steroid and thyroid hormone receptors is highly 

so conserved raised the question of whether this segment might be diagnostic for related ligand inducible 
transcription factors. It also raised the question of whether the DNA sequences encoding these domains 
might be used as hybridization probes to scan the genome for related, but novel, ligand - responsive 
receptors. Utilizing this approach, our group at the Salk Institute have identified several new gene products. 
As is shown in U.S.S.N. 108,471, one is the human aldosterone receptor (hMR, ATCC No. 67201) (see 

55 Arriza, ef a/., (1987) for the published version of this portion of U.S.S.N. 108,471); a second is a novel 
thyroid hormone receptor expr ssed at high levels in the rat c ntral nervous system (rTR alpha, ATCC No. 
67281) (see Thompson, ef a/. t (1987) for the published version of this portion of U.S.S.N. 108,471). 
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This disclosure describes the isolation and characterization of a cloned full - length cDNA encoding a 
novel retinoid receptor protein with homology to the DNA- binding and ligand - binding domains of the 
steroid and thyroid hormone receptors. In addition the construction and characterization of chimeric 
receptors made by "swapping* functional domains between the glucocorticoid, the mineralocorticoid, the 

5 thyroid, the estrogen - related, and the retinoic acid receptors is described. These chimeric receptors have 
hybrid functional characteristics based on the "origin" of the "parental" DNA -binding and ligand - binding 
domains incorporated within the chimeras. For example, if the DNA -binding domain in the chimeric 
receptor is a retinoic acid receptor DNA -binding domain (i.e., is obtained from wild -type retinoic acid 
receptor or is a mutant that contains the functional elements of retinoic acid DNA -binding domain), then 

to the chimera will have DNA -binding properties characteristic of a retinoic acid receptor. The same is true of 
the ligand - binding domain, If the ligand - binding domain in the chimeric receptor binds to thyroid 
hormone, then the chimera will have ligand - binding properties characteristic of a thyroid hormone receptor. 

This disclosure also describes a new method for identifying functional ligands for ligand - responsive 
receptor proteins. The method is illustrated by showing (1) that the retinoid, retinoic acid and its metabolic 

is precursor, retinol, are functional ligands for the newly discovered receptor protein, and (2) that the DNA- 
and ligand - binding domains determine the functional characteristics of the chimeric receptors. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 The following is a brief description of the drawings. More detailed descriptions are found in the section 
of the specification labeled, "Detailed Description of the Drawings". 

FIGURE 1 (A and B) is a drawing which shows the DNA nucleotide sequence and the primary protein 
sequence of phRARa. Fig. 1A shows the composite structure of phRARa aligned with a line diagram of 
some restriction endonuclease cleavage sites. Figs. 1B-1, 1B-2 and 1B-3 show the complete 

25 nucleotide sequence of phRARa and its primary amino acid sequence. 

FIGURE 2 (A and B) is composed of a drawing and a blot. Fig. 2A is a drawing which illustrates 
construction of the chimeric receptor hRGR. Fig. 2B is a blot which illustrates induction of CAT activity 
by retinoic acid. 

FIGURE 3 (A and B) is composed of two graphs. Fig. 3A is a graph illustrating dose - response to 
30 retinoids. Fig. 3B is a bar graph illustrating retinoic acid binding to cytosol extracts of transfected COS - 
1 cells. 

FIGURE 4 (A and B) shows a Southern blot analysis of human genomic DNA. Fig. 4A shows digested 
human placenta DNA hybridized under stringent conditions; Fig. 4B shows the same DNA hybridized 
under non - stringent conditions. 
35 FIGURE 5 shows a Northern blot analysis of retinoic acid receptor mRNA in rat and human tissues. 
FIGURE 6 is a schematic drawing which shows a comparison of hGR. hRR and hT 3 R0 
FIGURE 7 is a schematic diagram of a generalized steroid\thyroid\retinoic acid receptor gene. 
FIGURE 8 is a schematic drawing that shows amino acid comparison of members of the steroid hormone 
receptor superfamily. 

40 FIGURE 9 is a schematic drawing that shows the structure and activity of chimeric thyroid/glucocorticoid 
feceptors. 

DEFINITIONS 

45 In the present specification and claims, reference will be made to phrases and terms of art which are 

expressly defined for use herein as follows: 

As used herein, the generic term "retinoids" means a group of compounds which includes retinoic acid, 

vitamin A (retinol) and a series of natural and synthetic derivatives that can exert profound effects on 

development and differentiation in a wide variety of systems, 
so As used herein, species are identified as follows: h, human; r, rat; m, mouse; c, chicken; and d, 

Drosophilia. 

As used herein, "steroid hormone superfamily of receptors" refers to the class of related rec ptors 
comprised of glucocorticoid, mineralocorticoid, progesterone, estrogen, estrogen - related, vitamin Da, 
thyroid, v-ert>-A, r tinoic acid and E75 (Drosophilia) rec ptors. Se Evans (1988) and the refer nces 
55 cited therein. 

As used her in, RR and RAR both m an retinoic acid receptor. Th acronyms, hRR and hRAR, mean 
human retinoic acid receptor. The DNA r ferred to as phRARa codes for human retinoic acid receptor 
alpha. hRARa is encoded by deposited phRARI which has been accorded ATCC No. 40392. The DNA 
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referred to as hRAR0 encodes human retinoic acid receptor beta. See Brand et aL (1988). 

As used herein, GR means glucocorticoid receptor. The DNA reterred to as hGR codes tor human 
glucocorticoid receptor GR. hGR is encoded by deposited pRShGR which has been accorded ATCC No. 
67200. 

5 As used herein, MR means mineralocorticoid receptor. The DNA referred to as hMR codes for human 

mineralocorticoid receptor MR. hMR is encoded by deposited pRShMR which has been accorded ATCC 
No. 67201. 

As used herein, TR means thyroid receptor. TRalpha and TRbera refer to the alpha and beta forms of 
the thyroid receptor. The DNAs referred to as c~erb-A t herb- A 8.7, peA101, rbeAl2, and hFA8 att code 

10 for thyroid receptors. Plasmid pherb-A 8.7 encodes hTRa; it has been deposited for patent purposes and 
accorded ATCC No. 40374. Plasmid peAl01 encodes hTR£; it has been deposited for patent purposes and 
accorded ATCC No. 67244. Plasmid rbeA!2 encodes rTRa; it has been deposited for patent purposes and 
accorded ATCC No. 67281. Plasmid phFA8 encodes a partial clone of hTRa that has a deletion in the 
"ligand -binding" region of the clone (i.e., the DNA that codes for the carboxy terminal end of the receptor 

is protein). Plasmid phFA8 has been accorded ATCC No. 40372. 

As used herein, ERR means estrogen - related receptor. The acronyms, hERRl and hERR2 refer to 
human estrogen - related receptors 1 and 2. These receptors are more related to steroid receptors than to 
the thyroid receptors, yet they do not bind any of the major classes of known steroid hormones (Giguere, ef 
a/, 1988). hERRl is encoded by deposited plasmids pE4 and pHKA, which have been accorded ATCC No. 

20 67309 and 67310, respectively. (Neither pE4 or pHKA are complete clones; hERRl is constructed by joining 
segments from both clones.) hERR2 is encoded by deposited plasmid phH3 which has been accorded 
ATCC No. 40373. 

As used herein, VDR means vitamin D3 receptor. 

As used herein, MTV means mammary tumor virus; MMTV means mouse mammary tumor virus. 
25 As used herein, RSV means Rous sarcoma virus; SV means Simian virus. 
As used herein. CA T means chloramphenicol acetyltransferase. 

As used herein, luciferase means firefly luciferase. See, de Wet, I.R., Wood, K.V., DeLuca, M.. Helinski, 
D.R., and Subramani, S., Mol.Cell.Biol. 7: 725 - 737 (1987). 

As used herein, COS means monkey kidney cells which express T antigen (Tag). See Gluzman, Cell, 
30 23:175 (1981). COS cells are receptor - deficient cells that are useful in the functional ligand identification 
assay of the present invention. 

As used herein, CV-1 means mouse kidney cells from the cell line referred to as "CV-1". CV-1 is 
the parental line of COS. Unlike COS cells, which have been transformed to express SV40 T antigen 
(Tag), CV-1 cells do not express T antigen. CV-1 cells are receptor - deficient cells that are also useful in 
35 the functional ligand identification assay of the present invention. 

As used herein, the generic terms of art, "hormone response elements" or "HRE's", "transcriptional 
control units", "hormone responsive promoter/enhancer elements", "enhancer - like DNA sequences" and 
"DNA sequences which mediate transcriptional stimulation", all mean the same thing, namely, short ex- 
acting sequences (about 20 bp in size) that are required for hormonal (or ligand) activation of transcription. 
40 The attachment of these elements to an otherwise hormone - nonresponsive gene causes that gene to 
become hormone responsive. These sequences, referred to most frequently as hormone response elements 
* or HRE's, function in a position - and orientation - independent fashion. Unlike other enhancers, the activity 
of the HRE's is dependent upon the presence or absence of ligand. (See Evans (1988) and the references 
cited therein.) In the present specification and claims, the phrase "hormone response element" is used in a 
45 generic sense to mean and embody the functional characteristics implied by all terms used in the art to 
describe these sequences. 

As used herein, synthetic HRE's refer to HRE's that have been synthesized in vitro using automat d 
nucleotide synthesis machines. Since the HRE's are only about 20 bp in size, they are easily synthesized in 
this manner, if wild -type, engineered or synthetic HREs are linked to hormone - nonresponsive promoters, 
so these promoters become hormone responsive. See Evans (1988) and the references cited therein. 

As used herein, the acronym GRE means glucocorticoid response element and TRE means thyroid 
receptor response element. GRE's are hormone response elements that confer glucocorticoid responsive - 
n ss via interaction with the GR. See Payvar, et a/.. Cell, 35:381 (1983) and Schiedereit, et a/., Nature, 
304:749 (1983). GRE's can be used with any wild -type or chimeric receptor whose DNA -binding domain 
55 can functionally bind (/.e., activate) with the GRE. For example, since GR, MR and PR receptors can all 
activate GRE's, a GRE can be used with any wild -type or chimeric receptor that has a GR, MR or PR- 
type DNA -binding domain. TRE's ar similar to GRE's except that th y confer thyroid hormone respon- 
siveness via interaction with TR. TRE's can b us d with any wild - type or chimeric receptor whose DNA - 

5 

BNSDOC1D: «EP 0540065A1J_> 



EP 0 540 065 A1 



binding domain can functionally bind (i.e., activate) with the TRE. Both TR and RR receptors can activate 
TRE's, so a TRE can be used with any receptor that has a TR or RR - type DNA - binding domain. 

As used herein, ligand means an inducer, such as a hormone or growth substance. Inside a cell the 
ligand binds to a receptor protein, thereby creating a ligand/receptor complex, which in turn can bind to an 

5 appropriate hormone response element. Single ligands may have multiple receptors. For example, both the 
T3 R„ and the T3 R* bind thyroid hormone such as T3 . 

As used herein, the word "operative", in the phrase "operative hormone response element functionally 
linked to a ligand - responsive promoter and an operative reporter gene", means that the respective DNA 
sequences (represented by the terms "hormone response element", "ligand - responsive promoter" and 

70 "reporter gene") are operational, i.e., the hormone response element can bind with the DNA -binding 
domain of receptor protein (either wild -type or chimeric), the ligand - responsive promoter can control 
transcription of the reporter gene (upon appropriate activation by a HRE/receptor protein/ligand complex) 
and the reporter gene is capable of being expressed in the host cell. The phrase "functionally linked" 
means that when the DNA segments are joined, upon appropriate activation, the reporter gene (e.g. CA T or 

75 luciferase) will be expressed. This expression occurs as the result of the fact that the "ligand responsive 
promoter" (which is downstream from the hormone response element, and "activated" when the HRE binds 
to an appropriate ligand/receptor protein complex, and which, in turn then "controls" transcription of the 
reporter gene) was "turned on" or otherwise activated as a result of the binding of a ligand/receptor protein 
complex to the hormone response element. 

20 As used herein, the phrase "DNA -binding domain" of receptors refers to those portions of the receptor 
proteins (such as glucocorticoid receptor, thyroid receptor, mineralocorticoid receptor, estrogen - related 
receptor and retinoic acid receptor) that bind to HRE sites on the chromatin DNA. The boundaries for these 
DNA -binding domains have been identified and characterized for the steroid hormone superfamily. See 
Figure 8; also see Giguere, et aL (1986); Hollenberg. et a/., (1987); Green and Chambon (1987); and 

25 Miesfield. et aL (1987), Evans (1988). 

The boundaries for the DNA -binding domains for various steroid hormone superfamily receptors are 
shown in Figure 8; the boundaries are as follows: 

-hGR (pRShGR): nucleotide 1393 to 1590 amino acid 421 to 486 (See ATCC #67200) 

-hTRb (peA101): nucleotide 604 to 807 amino acid 102 to 169 (See ATCC #67244) 

30 -hTRa (pherb-A8.7): nucleotide 168 to 372 amino acid 50 to 117 (See ATCC #40374) 

amino acid 291 to 358 (See ATCC #67281) 

- ERR1 (pE4 & pHKA): amino acid 176 to 241 (See ATCC #67309 & #67310) 

- ERR2 (phH3): amino acid 103 to 168 See ATCC #40373) 

-hMR (pRShMR): nucleotide 2029 to 2226 amino acid 603 to 668 (See ATCC #67201) 

35 - hRARa (phRARI): nucleotide 364 to 561 amino acid 88 to 153 (See ATCC #40392) 

The DNA -binding domains of the steroid hormone superfamily of receptors consist of an amino 
segment varying between 66 to 68 amino acids in length. This segment contains 9 cysteine residues, one 
of which is the first amino acid of the segment. This first Cys residue begins a motif described as Cys- 
X2 -Cys-Xi 3 -15 -Cys-X2 - Cys, where X is any amino acid residue. The DNA -binding domain invari - 
40 ably ends with -the amino acids Gly - Met. 

,-^or convenience in the cloning procedure, between 1 and 6 amino acids residues preceding and/or 
following the DNA -binding domain can be switched along with the DNA -binding domain. 

As used herein, the phrase "ligand -binding domain region" of receptors refers to those portions of the 
receptor proteins that bind to ligands such as growth substances or the hormones. These boundaries of the 
45 ligand - binding domains for the steroid receptor superfamily have been identified and characterized. See 
Figur 8 and Evans (1988). 

The ligand - binding domains for the various receptors are shown in Figure 8; some of those domains 
are as follows: 

- hGR (pRShGR): amino acid 528 to 777 (See ATCC #67200) 
50 -hTRb (peA101): amino acid 232 to 456 (See ATCC #67244) 

-hTRa (phert>-*8.7): amino acid 183 to 410 (See ATCC #40374) 
-rTR (rbeA12) amino acid 421 to 639 (See ATCC #67281) 

-ERR1 (pE4 & pHKA): amino acid 295 to 521 (See ATCC #67309) 
-ERR2 (phH3) amino acid 212 to 433 (See ATCC #40373) 

55 - hMR (pRShMR): amino acid 734 to 984 (See ATCC #67201) 

-hRARa (phRARI): amino acid 198 to 462 (See ATCC #40392) 

Common restriction endonuc lease sit s must be introduced into receptor cDNA clones to allow 
exchange of functional domains between rec ptors. In any of the various receptors referred to in Figur 8, 
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the first common site can be introduced immediately preceding the DNA- binding domain, the second 
common site immediately following it. (For example, in any of the steroid hormone superfamily of receptors 
that are shown in Figur 8, a uniqu Not\ site can be introduced immediately preceding the DNA - binding 
domain and a unique Xho\ site can be introduced immediately following it. This divides the receptors into 

5 three functional regions or "cassettes"; (1) an N- terminus cassette, <2) a DNA -binding domain cassette, 
and (3) a ligand - binding domain cassette. The three regions or cassettes from any one receptor can be 
combined with cassettes from other receptors to create a variety of chimeric receptors. 

As used herein, the nomenclature used to identify the chimeric receptors is as follows: The various * * 
functional domains (N- terminus, DNA -binding and figand - binding) are identified according to the 

io "parental" receptor from which they originated. For example, domains from GR are "G" domains; TR 
domains are "T" domains (unless otherwise further specified as being "T a " or "T b " domains); MR domains 
are "M" domains; RAR domains are "R" domains ( unless otherwise further specified as being "R a " or R b " 
domains), and ERR domains are "E" domains (unless otherwise specified as being "E, " or "E2 W 
domains). According to this notation, unless otherwise specified, "T" is used generically to mean either the 

75 T3R* or the T 3 R e receptors; "E" means either hERR1 or hERR2; and "R" means either the RARa or the 
RAR0 receptors. Wild -type receptors do not contain any exchanged domains, and so according to this 
notation system would be identified as G-G-G (or GGG), T a -T a -T a (or T a T a T a ), T b -T b -T b (or 
T b T b T b ), M-M-M (or MMM), R a -R a -R a (or R a R a R a ), R b -R b -R b (or R b R b R b ), Ei - E, -Ei or E 2 -E 2 - 
E 2 , where the first domain listed is the N- terminus domain, the middle domain is the DNA -binding 

20 domain, and the last domain is the ligand - binding domain. Any chimeric receptor will have functional 
domains from at least two wild -type or parental sources. For example, the chimeric receptor GGR a would 
have N-termimus and DNA -binding domains from glucocorticoid receptor and the ligand -binding domain 
from the alpha retinoic acid receptor; 

25 G T ft R b 

would have the N- terminus from glucocorticoid, the DNA -binding domain from thyroid receptor alpha and 
the ligand - binding domain from retinoic acid receptor beta. 
30 As used herein, hGR N x, hTR0 NX , and hRR NX refer to hGR, hTR0 and hRR receptors that have been 
engineered to contain the unique sites for Nott and Xho\ flanking the boundaries for the DNA -binding 
domains in these receptors. These mutant receptors exemplify construction of hybrid receptors that are 
comprised of all possible combinations of amino termini, DNA -binding domains, and ligand - binding 
domains from hGR, hMR, hERR1, hERR2, hTRa, hTR£, rTRa, hRARa, and hRAR0. 
35 As used herein, Southern blot analysis refers to a procedure for transferring denatured DNA from an 
agarose gel to a nitrocellulose filter where it can be hybridized with a complementary nucleic acid. 

As used herein, Northern blot analysis refers to a technique for transferring RNA from an agarose gel to 
a nitrocellulose filter on which it can be hybridized to complementary DNA. 

As used herein, "mutant" DNA of the invention refers to DNA which has been genetically engineered to 
40 be different from the "wild -type" or unmodified sequence. Such genetic engineering can include the 
insertion of new nucleotides into wild -type sequences, deletion of nucleotides from wild -type sequenc s, 
r substitution of nucleotides in the wild -type sequences, or "swapping" of functional domains from one 
receptor to another. Receptors that have been engineered by "swapping" functional domains from one 
receptor to another are also referred to as chimeric or hybrid receptors. Chimeric receptors can be further 
45 engineered to include new nucleotides, deletion of nucleotides, substitution of nucleotides, etc. 

Use of the term "substantial sequence homology" in the present specification and claims means it is 
intended that DNA, RNA, or amino acid sequences which have slight and non - consequential sequence 
variations from the actual sequences disclosed and claimed herein are within the scope of the appended 
claims. In this regard, the "slight and non - consequential" sequence variations mean that the homologous 
so sequences will function in substantially the same manner to produce substantially the same compositions as 
the nucleic acid and amino acid compositions disclosed and claimed herein. 

As used herein, the term "recombinants produced" means made using genetic engineering techniques, 
not merely purified from nature. 

The amino acids which comprise the various amino acid sequences appearing herein may be identified 
55 according to the following three - letter or one - letter abbreviations: 
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r\l Mil ikJ r\\^i\J 


y n i-g0 — Letter Abbreviation 


One — Letter 
Abbreviation 




L - Alanine 


Ala 


A 


5 


L - Arginine 


Arg 


R 




L - Asparagine 


Asn 


N 




L - Aspartic Acid 


Asp 


D 




L - Cysteine 


Cys 


C 




L - Glutam ine 


Gin 


Q 


10 


L - Glutamic Acid 


Glu 


E 




L - Histidine 


His 


H 




L - Isoleucine 


lie 


1 




L - Leucine 


Leu 


L 




u Lysine 




K 


15 


L - Methionine 


Met 


M 




L - Phenylalanine 


Phe 


F 




L - Proline 


Pro 


P 




L - Serine 


Ser 


S 




L - Threonine 


Thr 


T 


20 


L - Tryptophan 


Trp 


W 




L - Tyrosine 


Tyr 


Y 




L - Valine 


Val 


V 



The nucleotides which comprise the various nucleotide sequences appearing herein have their usual 
single - letter designations (A, G, T t C or U) used routinely in the art. 
As used herein, bp means base pairs and kb means kiiobase pairs. 

In the present specification and claims, the Greek letters alpha (a), beta (0), etc. are sometimes referred 
to as a, b, etc. 

DEPOSITS 



Plasmids pRShGR (hGR), pRShMR (hMR), peA101 (hT 3 0) and GMCA T, all of which are in E.co// 
HB101, plus plasmids rebA12 (rTRa), pE4 and phKA (which together encode hERRt), phH3 (hERR2), 
phenb-A 8.7 <hTRa), phFA 8 (a partial clone of hTRa), and plasmid phRARI have been deposited at the 
American Type Culture Collection, Rockville, Maryland, U.S. A. (ATCC) under the terms of the Budapest 
Treaty on the International Recognition of Deposits of Microorganisms for Purposes of Patent Procedure 
and the Regulations promulgated under this Treaty. Samples of the plasmids are and will be available to 
industrial property offices and other persons legally entitled to receive them under the terms of said Treaty 
and Regulations and otherwise in compliance with the patent laws and regulations of the United States of 
America and "all other nations or international organizations in which this application, or an application 
claiming priority of this application, is filed or in which any patent granted on any such application is 
granted. 

The ATCC Deposit Numbers and Deposit Dates for the deposits are as follows: 



so 
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75 



pRShGR (hGR) 


67200 


Sept. 9, 1986 


pRShMR (hMR) 


67201 


Sept. 9. 1986 


pE4 (hERR1*) 


67309 


Jan. 30, 1987 


phHKA (hERRV) 


67310 


Jan. 30, 1987 


phH3 (hERR2) 


40373 


Sept. 29, 1987 


GMCAT (reporter) 


67282 


Dec. 18, 1986 


pherb-A 8.7 (hTRa) 


40374 


Sept. 29, 1987 


phFA 8 (hTRa*) 


40372 


Sept. 29, 1987 


peA101 (hTRb) 


67244 


Oct. 22, 1986 


prbeA12 (rTRa) 


67281 


Dec. 18, 1986 


phRARa (hRARa) 


40392 


Nov. 20, 1987 



(* means a partial clone) 

<pE4 & phHKA together encode complete hERR1) 
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In one aspect, the present invention comprises a double - stranded DNA segment wherein the plus or 
sense strand of the segment contains a sequence of triplets coding for the primary sequence of a protein 
which has ligand - binding and DNA -binding (or transcription - activating) properties characteristic of a 
retinoid receptor protein referred to herein as human retinoic acid receptor protein. According to this aspect 
of the invention, the double - stranded DNA segment is one which is capable of being expressed into 
retinoic acid receptor protein. 

In another aspect, the invention comprises a single - stranded DNA, which is the sense strand of a 
double - stranded DNA coding for retinoic acid receptor protein, and an RNA made by transcription of this 
double - stranded DNA. 

In another aspect, the invention comprises a plasmid, phRARI, which contains DNA coding for a 
retinoic acid receptor protein of the present invention (RARa). This plasmid has been deposited with the 
American Type Culture Collection for patent purposes; it has been accorded ATCC No. 40392. 

In still another aspect, the invention comprises a cell, preferably a mammalian cell, transformed with a 
DNA coding for retinoic acid receptor protein. According to this aspect of the invention, the transforming 
DNA is capable of being expressed in the cell, thereby increasing the amount of retinoic acid receptor, 
encoded by this DNA, in the cell. 

Further the invention comprises novel retinoic acid receptors made by expression of a DNA coding for 
retinoic acid receptor or translation of an mRNA transcribed from such a retinoic acid receptor coding DNA. 
According to this aspect of the invention, the retinoic acid receptors will be protein products of "unmodified" 
retinoic acid coding DNA's and mRNA's, or will be modified or genetically engineered retinoic acid receptor 
protein products which, as a result of engineered mutations in the receptor DNA sequences, will have one 
or more ''differences in amino acid sequence from the corresponding naturally occurring "wild -type" 
retinoic acid receptor proteins. Preferably these retinoic acid receptors, whether "unmodified" or 
"engineered", will have at least about 5% of the retinoic acid binding activity and/or at least about 5% of the 
DNA -binding or transcription - activating activity of the corresponding naturally occurring retinoic acid 
receptor. 

Further the invention comprises chimeric receptors made by exchanging the functional domains of one 
receptor with functional domains of another type. The chimeric DNA*s thus produced encode chimeric 
receptor proteins that have functional characteristics based on the "origin" of their respective DNA- and 
ligand - binding domains. The chimeric receptors of the invention include double - stranded DNA's that code 
for the chimeric receptors, as well as single - stranded DNA T s which are the sense strands of the double - 
stranded DNA's, and mRNA's made by transcription of the double - stranded DNA f s. The invention also 
comprises cells, both eukaryotic and prokaryotic, that are transformed with chimeric receptors encoding 
DNA f s of the invention. 

According to the chimeric receptor aspect of the invention, to effect the chimeric DNA fusions, two 
restriction endonuclease sites are introduc d into each receptor cDNA at comparable locations in or near 
the DNA -binding domains in order to divide the receptor DNA's into three functional domains or regions. 
(For xample, a unique Nofl site can be introduced immediately preceding the DNA - binding domain and a 
unique Xhc\ site can be introduced imm diately following it This divides the receptors into three functional 
regions or "cassettes"; (1) an N- terminus cassette, (2) a DNA -binding domain cassette, and (3) a ligand - 
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binding domain cassette. The three regions or cassettes from any one receptor can be combined with 
cassettes from other receptors to create a variety of chimeric receptors. This aspect of the invention is 
illustrated in the section of the specification labeled "Detailed Description of the Invention".) 

In the present specification and claims, the chimeric receptors (referred to also as chimera or hybrids) 
s are named by letters referring to the origin of the various domains. Domains from hGR are referred to as 
"G" domains, domains from hTR are T" domains, domains from hERR are "E" and domains from hRR are 
"R" domains. For example, the chimeric receptor "RGR" has the amino and carboxyl termini of hRR and 
the DNA -binding domain of hGR; the chimeric receptor "TGG" has the amino terminus from hTR, and the 
DNA -binding and carboxyl terminus from hGR. (In the diagram shown in Figure 7, the amino terminus of 
io the receptor is referred to domain A/B and the carboxyl terminus is referred to as domain E.) 

According to the notation used in the specification and claims, unless otherwise specified, "T" is used 
generically to mean either the T 3 R„ or the T 3 R 5 receptor; "E" means either hERRt or hERR2; and "R" 
means either the RARa or the RAR£ receptor. 

Chimeric receptors of the invention include chimera having (1) an N- terminus domain selected from 
75 the group of wild -type receptors consisting of hGR, hMR, hERRi , hERR 2 , rTRa, hT 3 a, hT 3 £, hRARa and 
hRAR£, (2) a DNA -binding domain selected from the group of wild -type receptors consisting of hGR, 
hMR, hERRi, hERR 2 , rTRa, hT 3 a, hT 3 0, hRARa and hRAR£, and (3) a ligand - binding domain selected 
from the group of wild -type receptors consisting of hGR, hMR, hERRi, hERR 2 , rTRa, hT 3 a, hT 3 £, hRARa 
and hRAR£, wherein any one chimeric receptors will have N- terminus, DNA -binding, and ligand - binding 
20 domains that originate from at least two different "wild -type receptor" sources. 

Preferred chimeric receptor DNA's of the invention include GRR, GRG, GGR, RGG, RGR, RRG, GTT, 
GTG, GGT, TGG, TGT. TTG, TTR, TRT, TRR. RTT, RTR, RRT, GTT, GTG, GGT, TGG, TGT, and TTG 
receptor DNA's, plus the chimeric hybrid receptor proteins made by expression of a chimeric DNA of the 
invention translation of an mRNA transcribed from such a chimeric receptor coding DNA. Preferably these 
25 chimeric receptors will have activity that exceeds exogenous background binding or transcriptional activa - 
tion activity levels in any given cell, or will have at least about 5% of the DNA - binding or transcription - 
activating activity of the corresponding naturally occurring receptor DNA -binding domain, and/or about 5% 
of the ligand - binding activity of the corresponding naturally occurring ligand - binding domain. 

The invention also comprises a method for identifying functional ligand (s) for receptor proteins. 
30 According to the method, DNA sequences (referred to herein as the sample sequences) can be isolated 
which code for receptor proteins and which have at least an operative portion of a ligand - binding domain 
and a DNA -binding domain. (As those skilled in the art will appreciate, not aJ) of the DNA sequences in the 
ligand - binding domains are necessary in order for the domains to be functional. The operative sequences, 
i.e., those that must be present if the domain is to bind ligand, can be identified by deletion studies on any 
35 given domain.) Once the sample DNA sequences are isolated, a chimeric gene can be created by 
substituting the DNA -binding domain region in the sample DNA sequence with a DNA -binding domain 
region taken from a DNA sequence coding for another receptor protein, e.g., glucocorticoid receptor protein, 
thyroid receptor protein, mineralocorticoid receptor protein or retinoic acid receptor protein. Next a suitable 
receptor - deficient host cell is transfected with: 
40 (1) the chimeric receptor gene, which is preferably carried on an expression plasmid, and 

,42) a reporter gene, such as the CAT gene or the firefly luciferase gene, which is also preferably carried 
on plasmid, and which is refered to in U.S.S.N. 180,471 as a reporter plasmid. In any case, the reporter 
gene is functionally linked to an operative hormone response element (HRE) (either wild -type or 
engineered) wherein the hormone response element is capable of being activated by the DNA -binding 
45 domain used to make the chimeric receptor gene. (For example, if the chimeric receptor gene contains 
the DNA -binding domain region from glucocorticoid receptor coding DNA. then the HRE should be a 
wild -type, an engineered, or a synthetic GRE, i.e., one that can be activated by the operative portion of 
the DNA -binding region of a glucocorticoid receptor protein, ff a thyroid receptor DNA -binding domain 
region is used, then the wild -type or engineered HRE should be responsive to a thyroid (or retinoic 
so acid) receptor protein, etc.) Next the transfected host cell is challenged with a battery of candidate 
ligands which can potentially bind with the ligand -binding domain region of the chimeric protein coded 
for by the chimeric gene. To determine which of these ligands can functionally complex with the 
chimeric receptor protein, induction of the reporter gene is monitored by monitoring changes in the 
protein levels of the prot in coded for by the reporter gene. (For example, if luciferase is the reporter 
55 gene, the production of luciferase is indicative of receptor - regulated gene transcription.) Finally, when a 
ligand(s) is found that can induce transcription of the reporter gene, it is concluded that this ligand(s) can 
bind to the receptor protein coded for by the initial sample DNA sequence. This conclusion can be 
further verified by testing the binding properties of the receptor protein, coded for by th initial sample 
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DNA sequences, ws-a- vis the ligand(s) that induce expression of the reporter gene. 

As those skilled in the art will appreciate, if a cell already contains (a) a chimeric DNA sequence (C) 
comprised of (1) operative portions of a DNA -binding domain of a first receptor sequence (/.e. t a first 
sequence) linked to (2) operative portions of a ligand - binding domain of a second receptor sequence {i.e., 
s a second sequence), and (b) a reporter nucleic acid sequence functionally linked to an operative hormone 
response element wherein the operative portions of the DNA - binding domain of the first receptor sequence 
can functionally bind to and activate the hormone response element that is functionally linked to the reporter 
sequence, then the method for identifying a functional ligand for a receptor protein will be comprised of 
challenging the cell with at least one candidate ligand and then monitoring induction of the reporter 
70 sequence by means of changes in the amount of expression product of the reporter sequence. 

The new functional ligand identification assay makes it possible to screen a large number of potential 
ligands or any given receptors, regardless of whether the receptor is a wild -type receptor or a chimeric 
one. 

The functional ligand identification method is illustrated herein by showing (1) that the retinoid, retinoic 
75 acid and its metabolic precursor, retinol, are functional ligands for the receptor protein coded for by 
phRARl DNA, and (2) that the DNA- and ligand - binding domains determine the functional characteristics 
of the chimeric receptors. 

The new functional assay, as well as the new retinoic acid receptor and the new chimeric receptors, are 
described more fully below. 

20 

DESCRIPTION OF THE INVENTION 
The Retinoic Acid Receptor 

25 In a continuing effort to explore the steroid hormone receptor superfamily. advantage was taken of the 
fortuitous identification of a novel genomic sequence with striking homology to the DNA -binding domain of 
the steroid hormone receptors (see Dejean et a/., 1986). This sequence spans the integration site of a 
hepatitis B virus (HBV) from a human hepatocellular carcinoma. 

To pursue the hypothesis that this gene might code for a previously unknown receptor, an 

30 oligonucleotide derived from this sequence was labeled and used to probe a number of human cDNA 
libraries. Five positive clones were initially isolated from a testis cDNA library. The insert from one of these 
clones (1hTlR) was used to isolate additional cDNA clones from a Xgt10 kidney cDNA library. A restriction 
map of the largest clone (phRARl) is shown in Figure 1A. Nucleotide sequence analysis reveals a long 
open reading frame of 462 amino acids beginning with a presumptive initiator methionine codon cor- 

35 responding to nucleotides 103-105 as shown in Fig. 1B-1. The sequence surrounding this ATG agrees 
with the consensus described by Kozak (1987) for a translation initiation site. Upstream of the ATG is an 
in -frame terminator providing support for the initiator methionine. Another methionine found 30 codons 
downstream fails to conform to the consensus and is an unlikely initiator. Following the terminator codon at 
position 1489-1491 is a 3* - untranslated region with a consensus polyadenylation signal (AATAAA) found 

40 20 nucleotides upstream of a polyadenylated tract (see Proudfoot, et aL, 1976). 

^ A polypeptide of relative molecular mass 50,772 d (51 Kd) is encoded within the translational open 
reading frame. The size of the protein encoded by the insert of phRARl was verified by in vitro translation 
of RNA (see Krieg, et a/., (1984)) derived from this insert and found to correspond to the predicted size of 
54 Kd (data not shown). Amino acid sequence of this protein has been compared to the glucocorticoid and 

45 thyroid hormone receptors. The highest degree of similarity is found in a cysteine - rich sequence of 66 
amino acids beginning at residue 88. Our group has previously demonstrated that this region of the hGR 
represents the DNA -binding domain for this receptor. See Giguere, ef a/., (1987) and Hollenberg, et a/., 
(1987). In addition, mutagenesis and expression studies have provided direct evidence for its role in 
transcriptional activation of genes harboring glucocorticoid response elements (GREs). See Giguere, et a/., 

so (1987) and Hollenberg, et aL, (1987). 

Domain Switching and Transcriptional Activation 

Since the ligand for the gene product of phRARl was unknown, it was desirable to develop a quick and 
55 sensitive assay to reveal its identity. Previous studies have demonstrated that the DNA -binding domain of 
the human glucocorticoid and estrogen r ceptors can be interchanged to yield a functional hybrid rec ptor. 
This chimera recognizes the glucocorticoid responsive element of the MMTV - LTR but stimulates tran - 
scription in an estrogen - dependent fashion (see Green, et al., (1987)). This led us to wonder is if a g neral 
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domain - swapping strategy could be exploited to identify the ligand - binding properties of a novel hormone 
receptor. To test this approach we first substituted the DNA- binding domain of the phRARt gene product 
with the well described DNA -binding domain from the hGR (Fig. 2A). (This chimeric construction, when 
expressed in suitable host celts, produces a hybrid receptor protein whose ligand - binding domain region 

5 must bind with a functional exogenous ligand before the ligand/receptor complex can bind to a GRE, 
thereby activating a glucocorticoid inducible promoter.) 

To assay for the presence of a functional ligand the chimeric receptor gene was transfected into 
suitable host cells along with a suitable GRE linked reporter gene. CV - 1 cells were used for the assay 
along with a MMTV - CA T reporter gene. (MMTV- CAT is carried on reporter plasmid, GM-CAT. which 

to has been deposited with the American Type Culture Collection for patent purposes; see the section of this 
specification labeled, "Deposits". As those skilled in the art will appreciate, reporter ptasmids suitable for 
assaying hybrid thyroid receptor proteins, i.e., hybrid proteins having the DNA -binding domain of a thyroid 
receptor protein, can be constructed by substituting the GRE on plasmid GM- C>AT with a thyroid hormone 
responsive transcription element. For example, the growth hormone promoter can be functionally linked to 

75 the bacterial CAT gene. Since the growth hormone promoter contains a thyroid responsive transcription 
element, such a reporter plasmid can be used to assay hybrid thyroid receptor proteins. See the 
subheading: "Construction of Reporter and Expression Plasmids" in this specification. (Since mineralocor - 
ticoid receptors can activate GRE's, a reporter plasmid such as GM- CAT can be used to assay hybrid 
mineralocorticoid receptor proteins.) 

20 Returning to the functional ligand identification assay, the transfected cells were then systematically 
challenged with a battery of candidate ligands and induction monitored by changes in CAT activity. 

Because of their hormonal - like activities, the retinoids, including retinol (Vitamin A) and retinoic acid, 
were evaluated as potential inducers. Remarkably, retinoic acid elicited a dramatic increase in CAT activity 
of the hybrid receptor (Fig. 2B). No effect upon CAT activity was observed using the parent vector, 

25 pRShRRwx, or the wild type gene product from phRARI, herein referred to as human retinoic acid receptor 
(hRARa). As expected, the hybrid receptor is not induced by glucocorticoids, and the hGR is not induced 
by retinoic acid. 

As shown in Figure 3A, retinoic acid exhibits an ED50 value of 6 x 1Q~ 10 M on CAT activity induced by 
the hybrid receptor, which is consistent with EDso values observed for retinoic acid in a variety of biological 

30 assays (see Spom and Roberts, 1984). Retinol functions as a weak agonist with an ED50 value greater than 
100 nM. Retinyl acetate and retinyl palmitate function as even weaker inducers. A number of natural and 
synthetic ligands including testosterone, dihydrotestosterone, estrogen, dexamethasone, Cortisol, al- 
dosterone, progesterone, T 3 , T4, Vitamin D3 and 25 - OH - cholesterol failed to induce CAT activity. 

To corroborate the identity of the phRARI gene product as the retinoic acid receptor, the binding 

35 properties of the expressed product were evaluated following transfection of COS-1 cells. As shown in 
Figure 3B, transfected cells reveal increased capacity to specificalry bind 3 H- retinoic acid. This increase 
occurs over an endogenous background that is a likely consequence of the presence of cellular retinoid 
binding proteins as well as a significant non-specific binding. Consistent with the activation studies, the 
binding is fully competed by retinoic acid but only partially by retinol. Thyroid hormones, dexamethasone 

40 and vitamin Drdid not compete the binding of retinoic acid. 

A Gene Family 

To determine if the new retinoic acid gene was unique and to identify potentially related genes, human 
45 DNA was examined by Southern blot analysis. Hybridization of restriction endonuclease - digested human 
DNA with a labeled DNA fragment derived from the coding region of the hRR gene produced three bands in 
ev ry digestion consistent with a single hybridizing genetic locus (Fig. 4A). This hybridization pattern is 
unrelated to the restriction endonuclease map described by Dejean ef a/. (1986) for the HBV pre- 
integration site. However, when the hybridization conditions were relaxed, six additional bands were 
so observed in the products of each enzyme digestion (Fig. 4B). These observations suggested that there were 
at least one additional locus, and possibly more, in the human genome related to the retinoic acid receptor. 
The RAR£ has now been found. See Brand, ef a/, (1988). 

Expression of the hRR Ge ne 

55 

Since retinoic acid is known to exert effects on a large number of different c II types, we examined the 
expression of the hRR gene. Total cytoplasmic RNAs isolated from a variety of rat and human tissues w re 
size fractionated and transferred to a nitrocellulose fitter. Hybridization with a 600 - bp r striction fragment 
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from phRARl reveals a major RNA species of 3,200 nucleotides with highest levels in the hippocampus, 
adrenals, cerebellum, hypothalamus and testis (Fig. 5). Longer exposure shows that most tissues contain a 
small amount of the 3.2 kb transcript while it is undetectable in some tissues such as liver. 

s Retinoic Acid Receptor Data Summary 

The data disclosed herein identify the gene product of phRARl as a human retinoic acid receptor 
based on three criteria. First, the overall structural homology of the hRR to steroid and thyroid hormone 
receptors (Fig. 6) suggests that it is likely to be a ligand - responsive regulatory protein. Second, an 
io expressed chimeric receptor, consisting of the DNA- binding domain of the hGR and the presumptive 
ligand - binding domain of the hRR acts as a transcriptional regulator of a glucocorticoid - inducible reporter 
gene only in the presence of retinoic acid. This induction occurs at physiological levels. Third, expression of 
the candidate hRR in transfected cells selectively increases the capacity of those cells to bind retinoic acid. 

is Development and Oncogenesis 

The retinoids comprise a group of compounds including retinoic acid, retinol (vitamin A) and a series of 
natural and synthetic derivatives that together exert profound effects on development and differentiation in a 
wide variety of systems. See Sporn & Roberts, (1983); Mandel & Cohen, (1985); Wolback & Howe, (1925); 

?o Lotan (1980); and Fuchs & Green, (1980). Although early studies focused on the effects of retinoids on 
epithelial growth and differentiation, their actions have been shown to be more widespread than previously 
suspected. Many recent studies demonstrate the effects of these molecules on a variety of cultured cell 
lines including neuroblastomas (see Hausler. et aL, (1983)). melanomas (see Lotan, et a/., (1983)) and 
fibroblasts (see Shroder et a/., (1982)). In the human promyelocyte leukemia cells (HL-60), retinoic acid is 

25 a potent inducer of granulocyte differentiation (see Breitman, et a/., (1980)). In F9 teracarcinoma stem cells, 
retinoic acid will induce the differentiation of parietal endoderm, characteristic of a late mouse blastocyst 
(see Strickland & Mahdavi, (1978); Jetten et a/., (1979); and Wang ef a/., (1985)). Retinoic acid has been 
shown to exert equally potent effects in development. For example, in the developing chick limb bud, 
retinoic acid is able to substitute for the action of the polarizing region in establishing the anterior - posterior 

30 axis (see Tickle & Eichele, (1985)). By controlling the exposure to retinoic acid, it is possible to generate 
novel patterns of limb structures. Although retinoic acid is primarily considered a morphogen, Northern blot 
analysis suggests a re - evaluation of its function in the adult. In humans, retinol deficiency has been linked 
to an alarming increase in a variety of cancers (see Moon & Itri, (1984)). Retinoids have also been shown to 
inhibit tumor progression in animals and block the action of tumor promoters in vitro. In this context, the 

35 hRR may be considered as a negative regulator of oncogenesis. 

A Superfamily of Regulatory Genes 

Two surprising results have emerged from the studies presented here. The first is the discovery of a 

40 family of retinoic acid receptor - related genes which predicts the existence of one or more other proteins 
^with closely related properties (e.g., the RAR0 described by Brand ef a/, (1988)). Physiological studies 
demonstrate that both retinoic acid as well as retinol (vitamin A) can exert potent effects on cellular 
differentiation and that these effects are often not linked. It thus seems likely that at least one related gene 
product might be a specific retinol receptor or a receptor for another member of the retinoid family. The 

45 second surprising observation from these results is the close kinship of the retinoid receptor with the thyroid 
hormone receptor. (As we show below, the retinoic acid receptor can activate a thyroid response element or 
TRE; see the section of the specification labeled "Retinoic Acid and Thyroid Hormone Induce Gene 
Expression Through a Common Response Element".) This relationship is surprising in part because of the 
structural dissimilarity of the thyroid hormones and the retinoids. Thyroid hormones being derived from the 

so condensation of two tyrosine molecules whereas, the retinoids are derived from mevalonic acid. The 
observation that chemically distinct molecules interact with receptors sharing common structures most lik ly 
reflects a common mode of action with which they elicit their particular regulatory effects. Based on this 
analogy, we can now propose that the interaction of retinoids with their intracellular receptors induces a 
cascade of regulatory events that results from the activation of specific sets of genes by th 

55 hormone/receptor complex. Although animals employ diverse means to control their development and 
physiology, the demonstration that the retinoic acid receptor is part of the steroid receptor superfamily 
suggests that mechanisms controlling morphogenesis and homeostasis may be more universal than 
previously suspected. 
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Construction and Characterization of Chimeric Receptors 

Construction of chimeric receptor genes is discussed above in the sections of the specification labeled 
"Definitions", "Summary of the Invention" and "Domain Switching and Transcriptional Activation". In the 
5 sections that follow, construction and characterization of the chimeric receptors is illustrated by showing 
construction and and characterization of GR/TR hybrids. 

Materials and Methods 

Celi Culture and Transfection 

10 

CV-1 cells were used as the receptor - deficient host cells that were transfected with expression 
plasm ids that carry the chimeric RR/TR receptors, and reporter plasmids carrying the CA T reporter gene. 
Conditions for growth and transfection of CV-1 (African Green monkey kidney) cells were as previously 
described (Giguere et aL (1986)), except that the calcium phosphate precipitate was left on the cells for 4- 

75 8 hours, at which time the media was changed to DMEM with 5% T 3 free bovine serum (Scantibodies) 
minus or plus 10~ 7 M T 3 (Sigma). Cells were harvested 36 hours after the addition of T 3 , and CA T assays 
were performed as described by Gorman et aL (1982). Typically, 5 ug reporter and 1 ug expression vector 
were cotransfected, along with 2.5 ug RSV-0gal as a control for transfection efficiency. Acetylated and 
non-acetylated forms of [ u ^chloramphenicol were separated by thin layer chromatography, excised, and 

20 quantitated by liquid scintillation counting in Econofluor (DuPont) with 5% DMSO. £ - galactosidase assays 
were performed as described by Herbomel et al. (1984). CAT activity is expressed as percent conversion 
divided by £ - galactosidase activity. 

Construction of Reporter and Expression Plasmids 

25 

Synthetic oligonucleotides corresponding to -169 to -200 of the rat growth hormone gene was 
inserted into a linker scanning mutant of MTV- CA7that has a HincflU site at position -190/- 181 (Buetti 
and Kuhnel (1986)). Expression vectors were constructed for the thyroid hormone receptors by inserting the 
full-length cDNAs of pheA12 (hTR0, see Weinberger, et at. (1986)) and rbeA12 (rTRa, see Thompson, ef 
30 al. (1987)) between the Kpn\ and BamH\ sites of the pRS vector (Giguere, et aL (1986) and (1987)). 

Construction of Chimeric Receptors 

The construction of hGRnx has been described (Giguere, et al. (1987). To construct hTR£ N x. the cDNA 
35 insert of pheA12 (hTR/S, see, Weinberger, et at. (1985) and (1986)) was subcloned between the Kpn\ and 
Ba/nHI sites of M13mp19 and mutagenized by the method of Kunkel (1985). The oligonucleotide used to 
create the Nott site changed three amino acids: Asp97 to Arg, Lys98 to Pro, Asp99 to Pro. The 
oligonucleotide used to create the Xhc\ site changed two amino acids: Thr171 to Leu, Asp172 to Gly. The 
mutant receptor cDNA was then transferred to the expression vector pRS (Giguere, et al. (1986) and 
40 (1987)); hybrids were constructed by exchanging Kpn\-No% Kpn\-Xhol t or Not\-Xhd\ restriction frag- 
ments between RShGRwx and RShTR/Swx- RShGRwx has about 75% of wild -type DNA- binding activity, 
and RShTR^ has about 60% of wild - type DNA - binding activity. 

The Cis/Trans Functional Ligand 
45 Identification Assay 

The cis/trans functional ligand identification cotransfection assay was used to study chimeric receptors 
constructed by swapping domains between the glucocorticlod, the thyroid and the retinoic acid receptors. 
(As those skilled in the art will appreciate, the cis/trans cotransfection assay can be used to study chimeric 

so receptors made by swapping functional between any of the wild -type or genetically engineered receptors.) 
In the cis/trans assay, preferably two plasmids are transfected into a receptor deficient cell line. The first 
plasmid is used to express the receptor protein (whether wild -type, chimeric or genetically engineered). 
The second plasmid is used to monitor transcription from a ligand or hormone responsive promoter. For the 
thyroid hormone receptor assay, the xpr ssion plasmid consists of the Rous Sarcoma Virus long terminal 

55 repeat (RSV-LTR) directing the expression of a cDNA encoding a thyroid hormone receptor. For the hGR, 
the reporter plasmid is the mouse mammary tumor virus long terminal repeat (MTV-LTR) fused to the 
bacterial chloramphenicol acetyltransferas (CAT) gene. To convert MTV -CAT to a thyroid hormone 
responsive reporter, an oligonucleotide containing a thyroid hormone response element (THE) was inserted 
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at position -191 of the MTV-LTR. This sequence, -169 to -200 of the rat growth hormone (rGH) gene, 
specifically binds thyroid hormone receptors and can confer T3 responsiveness to a heterologous promoter 
(Glass et al. (1987)). Expression and reporter plasmids were cotransfected into CV-1 cells and CAT 
activity was masured in th absenc and presence of T 3 . The assays showed that neither the alpha nor 

5 the beta thyroid hormone receptor activates transcription from MTV -CAT, in the absence or presence of 
T3. (Data not shown.) However, the addition of a TRE produces an MTV promoter that is thyroid hormone 
responsive. Induction of CAT activity is dependent on the cotransfection of a functional alpha or beta 
thyroid hormone receptor and the addition of T3. In the presence of T3, the alpha receptor (rTRa) induces 
CA T activity approximately 15 -fold, while the beta receptor (hTR/S) induces activity by about 5 -fold. 

70 The hybrid thyroid hormone/glucocorticoid receptors were constructed to compare the functional 
properties of the thyroid and glucocorticoid hormone receptors. To facilitate the construction of the chimeric 
hybrid receptors, unique sites for the restriction enzymes Atofl and Xho\ were inserted flanking the DNA 
binding domains of hGR and hTR£. These mutant receptors, termed hGR NX and hTR0 NX , can be used to 
create hybrids with all possible combinations of amino termini, DNA -binding domains, and ligand- binding 

15 domains for these receptors. (As those skilled in the art will appreciate, comparable plasmids, such as 
pRARa NX or pMRwx for example, can be used to create chimeric receptors consisting of all possible 
combinations of all functional domains from the various receptors in the steroid hormone receptor 
superfamily. The receptors and the locations of the various functional domains are shown in Figure 8.) The 
hybrid and parental receptors were assayed using both thyroid hormone and glucocorticoid responsive 

20 promoters, in the absence or presence of T3 or the synthetic glucocorticoid dexamethasone. 

The structures and activities of the hybrid thyroid/glucocorticoid receptors are shown in Figure 9. The 
receptors are divided into three sections, and hybrids are named by letters referring to the "origin" of the 
domain; for example, "T-G-T" has the amino and carboxyl termini of hTR£ (T-.-T) and the DNA 
binding domain of the hGR (-G-). Hybrids with a putative hTRtf DNA binding domain (TTG, GTT, GTG) 

25 activated transcription from TRE - CA 7, while hybrids with an hGR DNA binding domain (GGT, TGG. TGT) 
activated transcription from GRE-CA7. This demonstrates that this region of hTR£ is analogous to the hGR 
DNA binding domain and is responsible for promoter recognition. Hybrid receptors with an hTRfl carboxyl 
terminus were activated by T3, while those with an hGR carboxyl terminus were activated by dex- 
amethasone. This is consistent with the identification of the carboxyl terminus as the part of the receptor 

30 that is responsible for hormone binding and activation specificity. Taken together, the functional properties 
of these hybrids support the assignment of the DNA -and ligand - binding domains of hTR0. 

Retinoic Acid and Thyroid Hormone Induce Gene Expression Through a Common Responsive Element 

35 Identification of a functional retinoic acid responsive element (RARE) is crucial to our understanding of 
the mechanisms by which retinoic acid receptors activate gene expression and regulate cell differentiation. 
One impediment to such a study is the absence of any identified gene whose transcription is directly 
dependent on the retinoic acid receptor - hormone complex. An alternative approach to localize a RARE is 
to systematically challenge the inducibility of known hormonally responsive promoters with retinoic acid 

40 receptor prgduced from cloned cDNA. (As discussed above under the heading "The Cis/Trans Assay", in 
this system, transcriptional activation from a promoter containing a HRE is dependent on expression of 
functional receptor from cotransfected expression plasmids in receptorless cells such as CV - 1 .) Because 
the DNA -binding domains of the retinoic acid and thyroid hormone receptors are highly related (62% 
identical in their amino acid sequences, see Figure 6), the possibility that the retinoic acid receptor could 

45 activate gene expression through a TRE was investigated. 

TRE's are known; see, for example, Glass, et al, (1987) for a discussion of a cis- acting element in the 
rat growth hormone 5' flanking genomic sequence that is necessary for thyroid hormone (3,5,3' - triiodo - 
L- thyronine, T3) regulation. 

To test if a TRE could effectively function as a RARE, a novel T 3 responsive promoter was constructed 

so by replacing the glucocorticoid responsive elements present in the mouse Mammary Tumour Virus -Long 
Terminal Repeat (MTV-LTR) with an oligonucleotide encoding the natural TRE G h- This promoter was th n 
fused to the bacterial chloramphenicol acetyl transferase (CAT) gene to generate the reporter plasmid 
A MTV - TREqh ~ OA 7. After transient transfection into CV-1 cells, the inducibility of the promoter was 
determined by measuring CAT activity. When CV-1 cells are cotransfectd with the expression vector 

55 containing a human thyroid hormone receptor beta (pRShT 3 R0) and the reporter plasmid AMTV -TREqh ~ 
C^7, induction in CAT activity is observ d in the presence of T 3 . In contrast, cotransfection of an 
expression vector encoding the human glucocorticoid receptor (pRShGRa) and the same reporter plasmid 
did not stimulate CAT activity from this promot r in response to the synthetic glucocorticoid dex- 
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amethasone. These results clearly demonstrate that the induction of CA T activity by RARa is conferred by 
the TRE because the wild - type MTV - LTR construct was not responsive. (Data not shown.) These results 
also show that the hRARar can specifically induce gene expression from a promoter containing a TRE- 

s RAR and GR Chimeric Receptors 

As discussed above, the modular structure of steroid hormone receptors makes it possible to exchange 
functional domains from one receptor to another to create functional chimeric receptors. This strategy was 
used to create hGR/hRARa chimera that had the RAR DNA- binding domain and the GR ligand - binding 
w domain. When CV-1 cells were cotransfected with the expression plasmid encoding hGRG and the 
reporter A MTV - TRE GH - CA T t dexamethasone specifically elicited CAT activity. (Data not shown.) This 
experiment provided direct evidence that the DNA -binding domain of the hRARa determined the specific- 
ity of target gene activation. 

/5 DETAILED DESCRIPTION OF THE DRAWINGS 

FIGURE 1. DNA and primary amino acid sequence of phRARI. A, Schematic representation and 
restriction enzyme map of the phRARI clone. The stippled box represents the predicted open reading 
frame. B. (shown as B-1, B-2 and B-3) The complete nucleotide sequence of phRARI is shown with the 

20 amino acid sequence given above the long open reading frame. An upstream inframe stop codon at 
nucleotides 85-87 and polyadenylation signal are underlined. 

Figure 1 Methods. A 63-mer oligonucleotide corresponding to nucleotides 408-477 of the genomic 
sequence published by Dejean et a/. (1986) was used as a hybridization probe to screen a human testis 
XgtiO library. The hybridization mixture contained 35% formamide. 1X Denhardfs, 5X SSPE, 0.1% sodium 

25 dodecyl sulfate (SDS), 100 ug ml" 1 denaturated salmon sperm DNA and 10 6 c.p.m. ml" 1 of 32 P- labeled 
oligonucleotide. Duplicate nitrocellulose filters were hybridized at 42 # C for 16 h, washed three times for 20 
min each in 2X SSC. 0.1% SDS (1XSSC = 150 mM NaCI, 15 mM sodium citrate) at 55 *C and 
autoradiographed at -70*C with an intensifying screen. Clone IhTIR obtained from this screening was 
partially characterized and then used as a hybridizing probe to screen a human kidney <i>gt10 cDNA library 

30 (see Bell, ef a/., (1986)). For this screening, the washing conditions were modified to 1XSSC with 0.1% 
SDS at 68 *C. Several cDNA clones were isolated and the longest clone, phRARI, was digested with a 
number of restriction enzymes and the resulting fragments were subcloned in both orientations into the M13 
sequencing vectors mp18 and mpl9 and sequenced by the dideoxy procedure (see Sanger, ef a/., 
(1977)). DNA sequences were compiled and analyzed by the programs of Devereux ef al. (1984) and 

35 Staden (1982). 

FIGURE 2. A, Construction of the chimeric receptor hRGR. The domain - structure of the various 
constructions are shown schematically, the numbers correspond to the amino acid positions of each 
domain. The DNA - binding domains are represented by "DNA" and the ligand - binding domains by their 
respective inducers. The Atofl and X/)01 sites created by site - directed mutagenesis to permit the 

40 exchange of the DNA -binding domains between receptors are indicated. B, Induction of CAT activity by 
retinoic acid. The expression vectors were cotransfected into CV-1 cells with the reporter plasmid 
MTVCA T and cultured for 2 days in absence or presence of 100 nM dexamethasone (DEX) or retinoic acid 
(RA). The receptor inserted into the expression vectors are: pRShGR, human glucocorticoid receptor; 
pRShRR, human retinoic acid receptor; PRShRR,,*, mutated human retinoic acid receptor with A/of1 and 

45 Xhol sites; pRShRGR, chimeric receptor composed of the human retinoic acid receptor which DNA- 
binding domain has been replaced by the human glucocorticoid receptor DNA -binding domain. 

Figure 2 Methods A, Restriction enzyme fragments of the cDNA inserts of phRARI and hGR (see 
Hollenberg, et a/., (1985) were subcloned into the Kpm and BamW sites of the mp19 vector and 
mutagenized according to the method of Kunkel (1985). The oligonucleotides used for the creation of the 

so A/ofl site within hGR and hRR were 28 and 31 nucleotides respectively, while the oligonucleotides used for 
th creation of the Xho^ site within hGR and hRR were 24 and 23 nucleotides. The creation of the A/on site 
resulted in the mutation of Pro*i6 to an Arg residue in hGR^x, and in the mutation of Ilea 4 and Tyr 8 s to Pro 
residues in hRR^. The introduction of the Xfroi sit did not alter the hGRwx amino acid sequenc but 
resulted in th mutation of Lysiss to a L u residu in hRR^. The mutant receptors were then transferred to 

55 the expression vector pRS (see Giguer , t a/., (1986), and the Not^tXho^ restriction fragm nt of 
pRShGRwx containing the hGR DNA -binding domain was introduced into pRShRnx between th Not\ and 
Xho^ sites to create pRShRGR. 8, Cell transfection and CAT assay. The recombinant DNA constructs (5 ug 
each) were introduced into CV-1 c lis by calcium phosphate coprecipitation (see Wigler, et a/., (1979)). 
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The ceils were then cultured for two days in serum free media supplemented with Nutridoma (Boehringer 
Mannheim) in presence or absence of inducers. CV - 1 cells were then prepared for CA T assays as 
described by Gorman, et a/. (1982) and the assays performed for 3 h using 25 ug of protein extract. All 
experiments with retinol were conducted in subdued light. 

5 FIGURE 3. A, Dose - response to retinoids. CV-1 cells cotransfected with pRShRGR and pMTVC^Af 

were treated with increasing concentrations of retinoids or a single 1 uM dose O of testosterone, 
dihydrotestosterone, estrogen, Cortisol, aldosterone, progesterone, triiodothyronine (T3), thyroxine (T*), 
dihydroxy - vitamin D 3 (VD3) and 25 - OH - cholesterol. The levels of CA T activity were plotted as percent- 
ages of the maximal response observed in this experiment. B, Retinoic acid binding to cytosol extracts of 

10 transfected COS-1 cells. Bars represent bound 3 H - retinoic" acid determined in absence (black bars) or 
presence (stippled bars) of a 1 000 - fold excess of various competitors. The values represent the mean of 
quadruplicate determinations. Competitors are retinoic acid (RA), retinol (R), T* , dexamethasone (DEX) and 
vitamin D 3 (VD 3 ). 

Figure 3 Methods A CV-1 cell cotransfections and CAT assays were performed as described in Figure 
15 2. Retinoic acid was dissolved in a minimum volume of dimethyl sulfoxide and diluted in ethanol All other 
products were diluted in ethanol and control cultures received 0.1% solvent (v/v) in media. Dose - response 
curves of retinoid treatment were performed in triplicate. B, Subconfluent COS-1 cells were transfected 
with 10 ug/dish of a control plasmid (pRS) or pRShRR by The DEAE-Dextran method (see Deans, et a/., 
(1984)). Cells were maintained for 2 days in DMEM with 5% charcoal - treated fetal calf serum, then 
20 harvested in TNE (40 mM tris-HCl pH 7.5, 150 mM NaCI, 1mM EDTA) and lysed by Dounce homogeni- 
zation in hypotonic buffer (50 mM tris-HCl pH 7.4, 0,1 mM EDTA, 5 mM dithiothreitol, 10 mM NaMoCX. 
10% glycerol, 0.5 mM phenyimethylsulfonyl fluoride) and centrifuged at 100,000 X g for 30 min to yield the 
cytosol fraction. Incubations were performed in hypotonic buffer with 150 ug of protein from the cytosolic 
fraction and 2 X 10~ 8 M 3 H- retinoic acid (NEN. 52.5 Ci/mmole) in a total volume of 200 ul. Specific 
25 binding was measured by the addition of 2 X 1 0~ 5 M of competitors. Reactions were carried out at 4 * C for 
16 h. Bound 3 H- retinoic acid was quantitated using DE-81 filters. Reactions were placed on filters for 1 
min and then rinsed with 5 ml of washing buffer (50 mM tris-HCl pH 7.4, 0.1 mM EDTA. 0.1 % Triton X- 
100). niters were dried and counted by liquid scintillation spectrophotometry. 

FIGURE 4. Southern blot analysis of human genomic DNA. A, Human placenta DNA was digested with 
30 the indicated restriction enzymes. After separation of the digested DNA in a 0.8% agarose gel (10 ug/lane) 
and transfer to nitrocellulose filters (see Southern. (1975), the blots were hybridized with an EcoRl X Pvuil 
fragment from phRARI ("600 bp) encompassing the DNA -binding domain of the hRR under high 
stringency conditions (50% formamide, 5X SSPE, 1X Denhardt's, 0.1% SDS, 100 ug ml -1 salmon sperm 
DNA). The filter was washed in 0.1X SSC, 0.1% SDS at 65 °C. Lambda Hind\\\ DNA markers (size in Kb) 
35 are aligned to left of the autoradiogram. B, Analysis of human placenta DNA using the same probe as in A 
under non - stringent conditions. A parallel blot containing identical samples was hybridized as in A, except 
that 35% formamide was used. The filter was washed in 2XSSC. 0.1% SDS at 55 *C. ~~ 
FIGURE 5. Northern blot analysis of retinoic acid receptor mRNA in rat and human tissues. 
Figure 5 Methods. Total RNA was isolated from various tissues using guanidine thyocyanate (see 
40 Chirgwin, et a/., (1980), separated on 1% agarose- formaldehyde gel, transferred to nitrocellulose, and 
hybridized under stringent conditions using the probe described in Fig. 4. Twenty ug of total RNA was used 
f in all lanes. Migration of ribosomal RNA's (28S and 18S) are indicated for size markers. The nitrocellulose 
filter was autoradiographed at - 70 * C with an intensifying screen for 1 week. 

FIGURE 6. Schematic amino acid comparisons of the hGR, hRR and hT 3 R0 structures. Amino acid 
45 sequences have been aligned schematically with the percentage amino acid identity for each region of 
homology in the intervals between dotted lines. 

FIGURE 7 is a schematic diagram of a generalized steroid\thyroid\retinoic acid receptor gene, showing 
the division of the gene into regions A/B. C, D, and E. The function of the A/B region is just beginning to be 
elucidated; the C region encodes the DNA - binding domain; the D region is believed to be a hinge region; 
50 and the E region encodes the ligand - binding domain. 

FIGURE 8 is a schematic drawing that shows amino acid comparison of members of the steroid 
hormone receptor superfamily. Primary amino acid sequences have been aligned on the basis of regions of 
maximum amino acid similarity, with the percentage amino acid identity indicat d for each region in r lation 
to the hGR (Miller et a/., (1985). Domains shown are: a domain at the NH 2 - terminal nd that is required for 
55 "maximum activity"; the 66- to 68 -amino acid DNA -binding domain core ("DNA"); and the 250 -amino 
acid ligand -binding (or hormone -binding domain) ("Hormone"). The amino acid position of each domain 
boundary is shown. Amino acid numbers for all receptors represent the human forms with the exception of 
verb- A and E75 (Segraves, 1988). Functional assignments have been determined by characterization of 
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the glucocorticoid and estrogen receptors. Designations are as follows: GR, glucocorticoid receptor; MR 
mineralocorticoid receptor; PR, progesterone receptor; ER, estrogen receptor; ERR1 or ERR2, estrogen - 
related 1 or 2; VDR. vitamin Cb receptor; and T 3 R* and T3R«, thyroid hormone receptors. The ( + ) or (-) 
indicates whether a particular property has been demonstrated for the products of cloned receptor cDNA or 

5 with purified receptor. HRE, hormone response element. This relates to whether the binding site has been 
identified structurally and whether its enhancement properties have been demonstrated by gene transfer 
studies. For PR, DNA- binding properties have been shown only with the native purified receptor. 
"Hormone binding in vitro" indicates whether this property has been demonstrated by translation in a rabbit 
reticulocyte lysate system (Hollenberg et at, 1985). "Hormone binding in vivo" refers to expression of the 

w cloned receptor in transfected cells. "Chromosome" indicates the human chromosome location. Species are 
as follows: h, human; r, rat; m, mouse; c, chicken; and d, Drosoph/Iia. 

FIGURE 9. Structure and activity of chimeric thyroid/glucocorticoid receptors. 

Figure 9 Methods. To construct hybrid receptors, unique Notl and Xhol sites were inserted flanking the 
DNA binding domains of the hGR and hTR0. Hybrids were created by exchanging the appropriate 

75 segments of the receptor cDNA's. "DNA" indicates the DNA binding domain; n T$fT+ n and "Cortisol" indicate 
the ligand binding domains of hTR0 and hGR respectively. The numbers above the boxes indicate amino 
acid residues. Hybrids are named by letters referring to the origin of the domain; for example, "TGT" has 
the amino and carboxyl termini of hTR0 and the DNA binding domain of the hGR. All receptors were 
assayed on TRE-M CAT and GRE-M CAT in the absence and presence of T 3 and the synthetic 

20 glucocorticoid dexamethasone ("dex"). All of the combinations shown gave activation above background. 
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SPECIFICATION SUMMARY 

From the foregoing description, one of ordinary skill in the art can understand that the present invention 
provides substantially pure DNA which encodes the retinoid receptor protein referred to as retinoic acid 
55 receptor protein. The invention also provides a plasmid containing retinoic acid receptor DNA. This plasmid, 
phRARI , has been deposited with the American Type culture Collection for patent purposes. 

The invention is also comprised of retinoic acid receptor proteins, including modified functional forms 
thereof, expressed from the DNA (or mRNA) of the invention. 
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In addition to novel retinoic acid receptor DNA, RNA and protein compositions, the present invention 
includes chimeric hybrid receptors made by exchanging (1) the N- terminal domains, (2) the DNA -binding 
domains, and (3) the ligand - binding domains from hGR, hMR, hERR1, hERR2, TaR,,, T 3 R£, RARa, and 
RAR0 receptors with one another. The chimeric receptors so constructed have DNA - binding domain and 
5 ligand - binding domain characteristics similar to the DNA - binding domain and ligand - binding domain 
characteristics of the respective "parental" receptors from which they originated. 

Finally, the present invention involves a bioassay for determining the functional ligand s for receptor 
proteins, both wild - type and chimeric. 

The phRARI DNA of the invention can be used to make the retinoic acid receptor proteins, and 
io functional modified forms thereof, in quantities that were not previously possible. The same is true of the 
chimeric receptors. With the quantities of receptor protein available as a result of the present invention, 
detailed studies can be made of both the ligand/receptor complexes and the ligand/receptor/HRE com- 
plexes. In addition, an adequate supply of the retinoic acid receptor proteins means that they can now be 
used to screen compounds for retinoic acid receptor - agonists or retinoic acid receptor - antagonist activity. 
75 Availability of the receptor proteins also means that they can be used in diagnostic assays to determine the 
levels of retinoic acid present in various tissues and body fluids. 

Without departing from the spirit and scope of this invention, one or ordinary skill can make various 
changes and modifications to the invention to adapt it to various usages and conditions. As such, these 
changes and modifications are properly, equitable, and intended to be, within the full range of equivalence 
20 of the following claims. 

Claims 

1. DNA sequences encoding chimeric receptors which (receptors) have an N -terminus domain, a DNA- 
25 binding domain, and a ligand - binding domain wherein the N - terminus domain, the DNA -binding 

domain and the ligand - binding domain originate from known receptors selected from the generic 
group of parental receptors consisting of GR, MR, TR and ERR, wherein any one chimeric receptor will 
have an N- terminus domain, a DNA -binding domain, and a ligand - binding domain that originate 
from at least two different "parental" sources, thereby never being identical to any wild -type receptor. 

30 

2. DNA sequences encoding chimeric receptors according to Claim 1 selected from the group consisting 
of GTT, GTG, GGT, TGG, TGT, and TTG. 

3. Chimeric receptor proteins made by expression of a chimeric DNA according to Claim 1 or 2 or by 
35 translation of an mRNA transcribed from such a chimeric receptor coding DNA wherein said chimeric 

receptors have activity that exceeds endogenous background binding or transcriptional activation 
activity levels in any given cell, or will have at least about 5% of the DNA - binding or transcription - 
activating activity of the corresponding naturally occurring receptor DNA -binding domain, and/or about 
5% of the ligand - binding activity of the corresponding naturally occurring ligand - binding domain. 

40 " 

4^. A method for identifying functional ligands for chimeric receptor proteins, said method comprising: 

(a) introducing into a suitable receptor -deficient host cell: (1) a chimeric gene encoding a receptor 
according to Claim 3, and (2) a reporter gene functionally linked to an operative hormone response 
element wherein the hormone response element is capable of being activated by the DNA -binding 

45 domain region of the chimeric receptor protein; 

(b) challenging the host cell from step (a) with a battery of candidate ligands which can potentially 
bind with the ligand - binding domain region of the chimeric receptor protein; 

(c) monitoring induction of the reporter gene; 

(d) identifying as a functional ligand(s) that ligand(s) which is capable of inducing production of the 
so protein product of the reporter gene. 

5. A method according to Claim 4, wherein the known ligand -r sponsive receptor protein is selected 
from the group consisting of glucocorticoid receptor, mineralocorticoid receptor, human thyroid rec p - 
tors alpha and beta and rat thyroid r ceptor alpha, estrogen - related receptors hERR1 and hERR2. 

55 

6. A method according to Claim 4, wh rein the host cell is a COS cell. 
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7. A method according to Claim 4 (a) (2) wherein the reporter gene is selected from the group consisting 
of a chloramphenicol acety transferase (CAT) gene and a firefly luciferase gene. 

8. A method according to Claim 4 (a) (2) wherein the hormone response element is selected from the 
5 group consisting of wild -type, (1) glucocorticoid response element, (2) thyroid response element, (3) 

mineralocorticoid response element, (4) estrogen - related response element, (5) retinoic acid response 
element, and (6) vitamin D3 response element. 

9. A method according to Claim 8 wherein the glucocorticoid response element is part of the mammary 
10 tumor virus long terminal repeat sequence (MTV LTR), and the thyroid response element is part of the 

growth hormone promoter sequence. 

10. A method according to Claim 4 wherein the ligand(s) identified in step (d) are corroborated by 
evaluating the binding properties of the chimeric receptor and the ligand(s) identified in step (d). 

75 



20 



25 



30 



35 



40 



45 



50 



55 



21 

8NSDOCID: <EP 054O065A1_l_> 



EP 0 540 065 A1 



CO 

Id O03-. 



Ulfr2)I blus- 



(1012) IH ujeg- 



(0881) I nAd- 



O (9691)1 eius- 
— C62frl)HnAd 

(6081)1 eujsH 

C99U)IlSd 
C9frOL)inAd 



C9C8)inAd- 

(9lZ)inAd. 
(t99)IOBS- 

C889)IJSd" 



C890iud>|- 
Id oog- 



CD O 



*0 o 
— o 
< o 



CO 







>* < 












O t-> 


o 




k. o 


o 




Q_ O 






O H- 


o 




fc- o 


u 




a. u 


o 




— o 












> o 


>— 




o o 








»— 




a. o 






w O 


< 




>* < 






»— H- 


< 




>s O 


o 




— o 








<J 




C 1— 


f— 






O 




25 


o 




3 O 


o 




a> h— 


o 




-J o 


o 




CO o 






— < 


< 




X o 


a 






< 




— o 


o 










>s O 


< 




— o 


o 






a 




o 


AGC 




— CJ 

(J o 
O I- 


s 




w o 






a. o 

w < 


O 






1— 




o 3 


o 






>- 




a. o 


C3 




CO o 


C3 




>s O 


O 




O I— 


O 




w O 


GA 




CO o 
CO H- 


CGAG 




rSer 
CAGC 






CD O 


o 




CO < 


o 




c o 


o 




CO -< 






< < 


5 




w o 






cp O 






CO < 


O 




<o o 


o 










58 


o 






< 




CO h- 


o 




s < 


o 




o 


o 




o 






C3 


o 




1— 






»— 


»— 




o 


o 










o 


< 




o 


o 






o 




o 






>— 





o < 




co O 




fc- (J 




< 




a. o 




-J < 




< 












>K < 




< 




*— >— 








CO o 




co o 




— h- 




CO < 




— < 












>* < 




*- o 




^ 




< o 




>^ < 




o o 




— o 








o o 




Cw O 








3 < 




a> o 




CO »— 




CO < 


CO 


*J o 




— 




O I— 




lO »— 




w o 




> <J 




Q. O 




O < 




o o 




w O 




o 




a. o 




a. c> 




3 h- 




o < 




a> >— 




i_ o 








CL. o 




c o 








— < 




CO o 








CO 




CO o 




o o 




— < 




k. O 








CL. O 




c o 




O 




— < 




i» o 








cw t> 




3 O 




^» cj 








CO O 








CO <f 








o c > 
w 








» t % 




^ ■< 












™~ O 




r— 
















3 9 




n ^ 




C9 












_ ^ 




<o 




3 S 




— o 




















3 ^ 




— C9 




_ ^ 












O <C 




w K 








CO 




Q_ C_> 




CO 1* 




O C3 




i_ 




v. O 




co o 




a. o 




CO ^ 




i- o 




1- <-> 




4> O 




CO C9 




CO H 




CO <C 




3 <-> 




w O 




CO 1- 




CO o 




-1 O 








>* < 




^ s 




— O 




— < 




O C3 
















— 13 








O C9 








3 O 








O H- 








-t O 








~ <3 




CD 




© »— 




— H- 


CO 


2 < 


CO 


— < 




O »- 








u- O 




-c o 




a. o 




H- < 




o o 




<a o 




i- o 




— c_> 




a. o 




< C3 




a> o 




O < 




XZ H- 




o 








a. c_> 




o o 




k. o 








CO o 








CO »— 












GO 












CM 



3NSDOCID: <EP 0540065A1J_> 



22 



EP 0 540 065 A1 














_ ^ 




CD O 




>N O 




O 




CO C3 




>s < 








k. o 




JC o 




1— ^ 








© o 




CO *™ 




3 O 


co 


4) | 


CM 


•J O 












k» 




© O 




CO ^ 




© O 




JC 




IV 




40 O 
















CO < 




< O 




Q. O 




k- O 




»- h- 




3 O 




© H- 




— 1 O 




Q> O 




CO < 




< O 




© 








— < 






CM 


CO < 


CM 


< O 




3 O 




«> 1- 




-J O 




4- »— 




o o 




CO fc- 




— o 




« »- 




> C3 




o> 




k- O 




< y 




o o 








3 <3 




w < 




© o 




to h- 




k. O 




© <3 




CO < 

c o 


r» 


IS 






CM 


ss 




k. o 








< 




k. 1- 




f o 




•- < 




I. o 
















51 







CD 
CM 



3 O 
«> 

-J o 
to CJ 

o 

o »— 
« o 

— o 

< CJ 
•O 

— o 

< o 

CO O 

S3 

3 O 
© I- 
—I o 
3 O 
« 

k. o 
JC o 

S<5 



c o 
— < 

a o 
co < 
< o 
•a o 

<3 



I- < 

3 <-> 
© 

-J O 

«- k. CJ 

m £ o 

W K < 

JC o 

h- < 

« o 



— o 
a o 
o o 

L- O 
CL O 

3 C3 

V H» 

-J O 

C CJ 

— < 

o o 

CO o 

S3 

«D O 

— o 

< C3 
€S O 



*" 2 ° 
c5 5 5 

— CJ 

> a 

v. H- 

£ O 

I- < 

CO CJ 



CD 
CO 



o 

CD 



CM 





C 


CJ 






O 




3 O 






< 






c_> 




0) h- 




C3 


o 






< 




-J O 






<_> 




3 






3 CJ 






o 






< 




9> h" 






< 




O 














fO 


o 
















o 








< 


o 




< 


o 




OL (_> 




c 


o 






. 1— 




— « f M 




CO 


< 




to 










■< 


< 




< 










3 










C O 




€> 


»— 




co 






— ^ 




-J 


o 




< 








J m 


i. 


o 










3 C3 


Of 




o 


CM 


CD 




to 


CD 


C\J 




< 




2 


< 


CO 


-1 O 




3 






3 






O 




o 

-J 


CT 




O 


GA 




2 < 




>» CJ 




3 






a. o 










€> 






« < 




O 






—i 


o 




< O 








O 


o 




— o 




40 


< 




k- 


o 














a. 






> o 






o 




3 










CD 


o 




z> 


»— 




w O 




CO 


1— 




_ 1 


o 




< O 




a> 


o 




3 


o 




a o 




XT 


*— 




CD 


»— 




CO < 




a. 






-J 


o 




< o 






o 




C 


a 




o o 












< 










< 




C3 


o 




a. o 










C 


o 




C <J 




a> 


>— 




to 






— < 




2 


"< 




< 


$ 










o 






o 




3 O 




£ 


o 






o 




— > < 






< 




< 


o 




o c3 










CD 


o 




3 




to 


< 




<C 






«> H— 




< 






IV 


»— 




_j o 




c 


o 


r» 








Q, O 


CD 




< 


^» 




o 


•V 


CO < 


CM 




o 


r> 


< 


o 


CO 


< o 




3 


o 




CD 






c o 






< 




JC 






— < 




O 






a. 






o o 




o 








o 








v_ 


o 














Q. 






> 


C5 




< o 




w 






3 






Q. O 






o 




€> 






CO < 






< 




—J 


o 




< c3 




h. 










>s < 








to 


< 














< 


o 














o 




to O 




w 






£ 


o 




>n a 




<: 






1— 


< 








k. 


o 




3 


o 




CD O 






o 




a> 


k— 




— 5t 






< 




_ i 


o 




— < 




CO 


o 




o 


o 




3 O 








w 






CD >— 




o 


»— 




o. 


o 




— 1 O 




a> 


o 








CO o 
















>v CJ 






< 




o 


o 




o *- 




as o 




CD 


o 




CD U 










JC 






— >— 




< 






a. 










3 






>s o 




«5 




CO 




O 






CO 




CM 






CO 




o 


CO 


<8 




a> 


o 




(0 






k. O 












o 




V C9 






5 




< 






C/> < 




3 






c 


O 




3 O 




a> 


1— 




to 






CD H- 




—i 






< 


% 




o 




CD 






to 






3 






1— 






< 




a> »— 






< 




z 


o 




^ o 














>s 




to 






CD 






— CJ> 




< 






3 


< 




i3 <3 






















o 






CT) 




CO 












CT) 




O 



BNSEXXID: <EP 0540065A1_I_> 



23 



EP 0 540 065 A1 



?! ll II § S5SS85s£g5g5555 

b: ,SJ oo<ooo<oooooooo 

£ y —< — H! o o^ooooo<oo<ooo< 

m ® £ ^ C S? ~ ° « S 00<^^<OOOOOOUOO 

2:5 ^ n; < 2 > H o <oooot-h-^oo£oo"£- 

0 " < ^ °, H ^ 9; 5^ < ^<»-oo>-oo*-ooooo#- 

£2 ° £f oo o oo<^o^o^oo£ooo£ 

^ < — 5 ^ <-> < <<Ol-<f-OOK-<000<0 



2Jf oO < ooJ-0<oo<oo3<ooS 

^ - - • fr h <-> o ouooo£<<ooooSoo 

3< Oh- « W> H- U U 19 tf »- k. tl u (i m n n m m r. 



— _ : ' • — » — v> ^ « ll «I ri fl ta— 

- - - - * H *r ooo<>-»-oh-oooooo»- 

• £r w <-> — o < ooo<o*-oi-<<o<o<o 

O OO 3 O < ^00<0<OOo355»-Sri 



o o 



>r5 §5 - g o o o S < o 2 S t ts 5 y 5 ^ o 5 < 

O U w O >* H; O 0000^<Ot-0<o3£<0< 

t H y — J2 H oo^<o*-»-oi-oo<<oo< 

go >sO >*o o o o o £ o I- o o fc- W < o o o o < 

H; OO OO O O O O O O £ £ O O 5 O O O O 2 < 

— o oo *o < o»-o^-oi-ooooJ-oooo5 

?g 5gS«a«sy"«to5^5 



CD 

— „x§ ^.<S £3 bSpSt^tt 552S8S| 

V— / r* <£ >*o wo o o o o o f= o S o S o o o 8 < < 

S8 88 SH88StSJf 5 

% < »r < ^o o i-i-»-o<i-t^ooh-h-o<<o< 

P £5 «y co wo ooooo»-o»-<oooooo< 

v^O — H> — < « O »-UO<<<Ol-0<0<000< 

o O CO< <i-t><o»-<oooouo<o^ 

< wo ouo<»-^d<>-o»-H:ooo 

rz o »o o<oooh-ooooooooo 



< o 
« o — o 



>*< o »o o<oooh-ooooooo — _ — 

-J< > o oo co < < o o o O I- o o < S o o o O o ^ 

C»0 OJO wO Oi< CJO<<l— h-OOO»-<0H-O»— 

t-O wO o> o wo oooo»-ooooooo<o^< 

< o o co < < < u<wooSooou<uooo5 
^2 3 S SS? £<j <oo<oi-So^ooo<<o;r 
5 S rz < ojr w< k-o<oo»-w-oooou>-»-o>- 

>o o O J O < < 000>*»-»-<t-OI*OO0^-O< 

WO to h- wh wo »-<»-.<OOh-00000<0<i- 

^» < ~Z H £i y » W> HOOOOOO»-l-Of-<0000 

»~ *T <0 h- < C^f- f-0-<w > 0l-w>oOW>O0<H0»>- 

T — O >s o ao »— w o 0<<t*»-O0OOOOO0O<w. 

co *o h- o> — o eg to < moo oooooo>-oooP-ooooo 

co>o cooo ^r<o ooo^oSo£ooooSo^<| 

« O WO 3 O O W> w. o < < O t- O O i- < O O V O ^ < 

^ < >*< wo 0<OOOOH»-»-000<00<| 

•=J5 < -J H O-O H I 00<0^-0<w><h-H-000»-l 

3< <C O >*0 wo O O C3 O O t— H- 0 00 0 0<t— H-<l 

© "S © o >-ooo"<C"ooooooo< 

-JO -<o OO CS>< o < < o < ^-<<<<ooo 

<0O w O 3 O 3 O l-OOOo£<£o<355o<< 

< o to < oo -I o oooo<<ooooooooo< 

30 05O w< wO OOOOUH- Ol— Ot-00<OL>t- 

— < 4,0 O O00<^0<0<0000<0< 

oo — < CO w. CO < 00<000<»-<Ol-0<000 



3NSOOCID: <EP 054006SA1_I_> 



24 



EP 0 540 065 A1 



FIG. 2A 

1 

m 



420 487 531 
J L 



777 



DNA 



Cortisol 



hGR 



NX 



Not I Xho I 

^^^^NA-binding Domain Switch 

87 154 198 



|DNAl 



431 462 

>}>)))>)}>)}>>>)}>>*) — 

%Retinoic Ac 



hRR 



NX 



Not I Xho I 



Chimeric Receptor 



87 154 198 



431 462 



DNA 



7Z 



?^^%Retinoic Acid^%%l \ hRGR 



FIG. 2B 



AC 



Tf tttttt tttt 

, - DEX RA J , ~ DEX RA | ] - DEX RA [ f — DEX RA ( 



pRShGR pRShRR pRShRR nx pRShRGR 



BNSDOCID: <EP 054O065A1_l_> 



25 



EP 0 540 065 A1 



FIG. 3A 




COMPETITOR: - RA R T 4 DEX VD 3 - RA R T 4 DEX VD3 



MOCK pRShRR 



BNSOOCID: <EP O&400BSA1 I > 



26 



EP 0 540 065 A1 




EP 0 540 065 A1 



Rat 



Human 



» 1 



S 



D 



o 
o 
a, 



9 8 § S 



z 5 

Q g 
2 o 



a 

o 

CO 

1 a 

2 ^ 

5 s ^ 
a, co 



z 

Cu 

co 



CO 



Cx3 



28S - 



18S 



FIGURE 5 



BNSOOCID: <EP 0540065A1J_> 



28 



EP 0 540 065 A1 



X 

o 
o 
o 

o 
oo 



o 

o -i 



DC 



o 

o • 
CO 



o 
o 

lO 



o 

o H 



o 

O "I 
CO 



o 
o 

CVJ 



o 
o 



CM 

X 



DC 
DC 
sz 



DC 

CO 



CNJ 

CD * 



vp 
LO 



to 



T3: 

o 

<: 
{O 

"5 

DC 



CO 

-in « 



NO 

CO 
00 



as 

C\i 
CO 



CO 



< 



BISISDOCIO <EP 0540065A 1 _l _> 



29 



EP 0 540 065 A1 



IU 



BNSDOCID: <EP 0540065A1_I_> 



30 



EP 0 540 065 A1 




EP 0 540 065 A1 




32 



4. 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



Aaattcattoa Nnk 

EP 92 12 1951 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of i 



Relevant 



CLASSIFICATION OF THE 
APPLICATION Oat. CL4 ) 



D,X 



P,D, 

X 

Y 



D 



NATURE 

vol . 325, 1 January 1987, LONDON GB 
pages 75 - 78 

GREEN, S. & CHAMBON, P. 'Oestradiol 
induction of a g lucocorticoid-responsive 
gene by a chimaeric receptor 1 

* the whole document * 

WQ-A-8 803 168 (THE SALK INSTITUTE FOR 
BIOLOGICAL STUDIES) 

* the whole document * 

NATURE 

vol. 330, 3 December 1987, LONDON GB 
pages 420 - 421 

ROBERTSON, M. 'Towards a biochemistry of 
morphogenesis 1 

* the whole document * 

NATURE 

vol. 330, 17 December 1987, LONDON GB 
pages 624 - 629 

GIGUERE, V. ET AL. 'Identification of a 
receptor for the morphogen retinoic acid" 

* the whole document * 



The 



report has been er»wa op far all < 



1.3 



1-10 
1,3-10 
1-10 
1-10 



4-10 



C12N15/12 
C12N15/62 
G01N33/68 
G01N33/74 
C12N5/10 



TECHNICAL FIELDS 
SEAJtCHED (Int. CL4 ) 



C12N 
G01N 



THE HAGUE 



rr if m i if n i ■ 

01 MARCH 1993 



ANDRES S.M. 



CATEGORY OP 



DOCUMENTS 



i 

o 




T : theory or artadpU aaacrMag 
C : oorlter pataat inmawl, tot 

aftortaefUiagaase 
D : aoeaaMM dtoi la t 
L : aocaaoBt crtoi for otaar 



aaarof (fees 



BNSDOCID: <EP 0540065A1J_> 



